o 
Lri 



ru 




SEQUENCE LISTING 
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(C) STRANDEDNESS: single 
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(A) DESCRIPTION: /desc - "DNA" 
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(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B^ LOCATION: 1..5499 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



CCA GCT AGC AGG GAT GCG AAT CGG TTG TGG CAC CCC GTG GGC AGC CAC 
r-S Pro Ala Ser Arg Asp Ala Asn Arg Leu Trp His Pro Val Gly Ser His 
65 70 75 80 



^ CCC GCA GCG GCT GCA GCC AAG GTG TAC AGT CTG TTC CGA GAG CCT GAC 
0 1 Pro Ala Ala Ala Ala Ala Lys Val Tyr Ser Leu Phe Arg Glu Pro Asp 
U 85 90 95 



48 



96 



ATG GAG AGC ACC TCC CCG CGA GGT CTC CGG TGC CCA CAG CTC TGC AGC 
Met Glu Ser Thr Ser Pro Arg Gly Leu Arg Cys Pro Gin Leu Cys Ser 
15 10 15 

CAC TCT GGC GCC ATG AGA GCG CCG ACC ACC GCT CGC TGC TCC GGA TGC 
His Ser Gly Ala Met Arg Ala Pro Thr Thr Ala Arg Cys Ser Gly Cys 
20 25 30 

ATC CAA CGG GTG CGT TGG AGG GGC TTC CTG CCA CTT GTC CTG GCT GTC 14 4 

He Gin Arg Val Arg Trp Arg Gly Phe Leu Pro Leu Val Leu Ala Val 
35 40 45 



W TTG ATG GGG ACA AGT CAT GCC CAA CGG GAT TCC ATA GGG AGA TAC GAA 192 
tfJ Leu Met Gly Thr Ser His Ala Gin Arg Asp Ser He Gly Arg Tyr Glu 
ft! 50 55 60 



240 



GCG CCG GTC CCC GGC TTG TCG CCC TCT GAG TGG AAC CAG CCG GCC CAG 336 
Ala Pro Val Pro Gly Leu Ser Pro Ser Glu Trp Asn Gin Pro Ala Gin 
100 105 110 

GGG AAC CCG GGA TGG CTC GCA GAG GCC GAG GCC AGG AGG CCA CCT CGA 384 
Gly Asn Pro Gly Trp Leu Ala Glu Ala Glu Ala Arg Arg Pro Pro Arg 
115 120 125 

ACC CAG CAG CTG CGT CGA GTC CAG CCA CCT GTC CAG ACT CGG AGA AGC 4 32 

Thr Gin Gin Leu Arg Arg Val Gin Pro Pro Val Gin Thr Arg Arg Ser 
130 135 140 

CAT CCC CGG GGC CAG CAG CAG ATA GCA GCC CGG GCT GCA CCT TCT GTC 480 
His Pro Arg Gly Gin Gin Gin He Ala Ala Arg Ala Ala Pro Ser Val 
145 150 155 160 

GCG CGC CTG GAA ACC CCT CAG CGA CCC GCG GCT GCA CGG CGA GGG CGG 528 
Ala Arg Leu Glu Thr Pro Gin Arg Pro Ala Ala Ala Arg Arg Gly Arg 
165 170 175 

CTC ACT GGG AGA AAT GTC TGC GGG GGA CAG TGC TGC CCA GGA TGG ACA 57 6 



Leu Thr Gly 



ACA TCA AAC 
Thr Ser Asn 
195 

TGT CAG AAC 
Cys Gin Asn 
210 

TCT GGC TTC 
Ser Gly Phe 
225 

TTT GAC CCT 
Phe Asp Pro 



Arg Asn Val Cys 
180 

AGC ACC AAC CAC 
Ser Thr Asn His 



Gly Gly Gin Cys 
185 

TGT ATC AAA CCT 
Cys lie Lys Pro 
200 

CGA GGC TCC TGC AGC AGG CCC CAG 
Arg Gly Ser Cys Ser Arg Pro Gin 
215 

CGT GGG GCG CGC TGT GAG GAG GTC 
Arg Gly Ala Arg Cys Glu Glu Val 
230 235 

CAG AAT GCC AGG CCT GTG CCC AGA 
Gin Asn Ala Arg Pro Val Pro Arg 
245 250 



Cys Pro Gly Trp Thr 
190 

GTG TGT CAG CCT CCC 624 
Val Cys Gin Pro Pro 
205 

GTC TGC ATC TGC CGT 672 

Val Cys lie Cys Arg 

220 

Arc CCT GAG GAG GAA 720 
lie Pro Glu Glu Glu 
240 

CGC TCA GTG GAG AGA 7 68 

Arg Ser Val Glu Arg 
255 



GCA CCC GGT CCT CAC AGA AGC AGT GAG GCC AGA 
Ala Pro Gly Pro His Arg Ser Ser Glu Ala Arg 
260 265 



GGA AGT CTA GTG ACC 
Gly Ser Leu Val Thr 
270 



816 
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AGA ATA CAG 
Arg lie Gin 
275 

CTC AGC CAG 
Leu Ser Gin 
290 

GTT CGT CGG 
Val Arg Arg 
305 

GCT TTG CCT 
Ala Leu Pro 



CAT GTG AAC 
His Val Asn 



ATC AAG AAA 
lie Lys Lys 
355 



CCG CTG 
Pro Leu 



CCC TGG 
Pro Trp 



TAT CCG 
Tyr Pro 



TCA GGA 
Ser Gly 
325 

CAT CTC 
His Leu 
340 

ATC AAA 
lie Lys 



GTA CCA 
Val Pro 



CCC CTG 
Pro Leu 
295 

GCC ACT 
Ala Thr 
310 

CTC GAG 
Leu Glu 



TCA CCC 
Ser Pro 



GTC GTC 
Val Val 



CCA CCA TCA CCA 
Pro Pro Ser Pro 
280 

CAG CAG CAC TCA 
Gin Gin His Ser 



GGT GCC 
Gly Ala 



CTG AGA 
Leu Arg 



CCC TGG 
Pro Trp 
345 

TTC ACC 
Phe Thr 
360 



AAT GGC 
Asn Gly 
315 

GAC AGC 
Asp Ser 
330 

GGG CTG 
Gly Leu 



CCC ACC 
Pro Thr 



CCT CCA TCT CGG CGC 8 64 

Pro Pro Ser Arg Arg 
285 

GGG CCG TCC AGG ACA 912 

Gly Pro Ser Arg Thr 

300 

CAG CTG ATG TCC AAC 960 
Gin Leu Met Ser Asn 
320 

AGC CCA CAG GCA GCA 1008 
Ser Pro Gin Ala Ala 
335 

AAC CTC ACC GAG AAA 1056 
Asn Leu Thr Glu Lys 
350 

ATC TGC AAG CAG ACC 1104 
He Cys Lys Gin Thr 
365 



TGT GCC CGG 
Cys Ala Arg 
370 

ACC TTG TAC 
Thr Leu Tyr 
385 

CGT ATC TAT 
Arg He Tyr 



GGA CGC TGT GCC 
Gly Arg Cys Ala 
375 

AGT CAG GGT GGC 
Ser Gin Gly Gly 
390 

TTC TGC CAA ATC 
Phe Cys Gin He 



AAC AGC 
Asn Ser 



TGT GAG 
Cys Glu 



CAT GGG CAT GAC 
His Gly His Asp 
395 

CCC TGC CTG AAT 
Pro Cys Leu Asn 



AAG GGT GAC ACC ACC 1152 

Lys Gly Asp Thr Thr 

380 

CCC AAG TCT GGC TTC 1200 
Pro Lys Ser Gly Phe 
400 

GGT GGC CGC TGC ATC 1248 
Gly Gly Arg Cys He 



405 



410 



415 



01 



ru 



GGC CGG GAC GAG TGC TGG TGT CCA GCC AAC TCC ACA GGA AAG TTC TGC 
Gly Arg Asp Glu Cys Trp Cys Pro Ala Asn Ser Thr Gly Lys Phe Cys 
420 425 430 

CAT CTG CCT GTC CCG CAG CCA GAC AGG GAA CCT GCA GGG CGA GGT TCC 
His Leu Pro Val Pro Gin Pro Asp Arg Glu Pro Ala Gly Arg Gly Ser 
435 440 445 

CGG CAC AGA ACC CTG CTG GAA GGT CCC CTG AAG CAA TCC ACC TTC ACG 
Arg His Arg Thr ^eu Leu Glu Gly Pro Leu Lys Gin Ser Thr Phe Thr 
450 455 460 

CTG CCT CTC TCT AAC CAG CTC GCC TCT GTG AAC CCC TCG CTG GTG AAG 
Leu Pro Leu Ser Asn Gin Leu Ala Ser Val Asn Pro Ser Leu Val Lys 
465 470 475 480 

GTG CAA ATT CAT CAC CCG CCT GAG GCC TCT GTG CAG ATT CAC CAG GTG 
Val Gin He His His Pro Pro Glu Ala Ser Val Gin He His Gin Val 
485 490 495 

GCC CGG GTC CGG GGT GAG CTG GAC CCC GTG CTG GAG GAC AAC AGT GTG 
Ala Arg Val Arg Gly Glu Leu Asp Pro Val Leu Glu Asp Asn Ser Val 
500 505 510 

GAG ACC AGA GCC TCT CAT CGC CCC CAC GGC AAC CTA GGC CAC AGC CCC 
Glu Thr Arg Ala Ser His Arg Pro His Gly Asn Leu Gly His Ser Pro 
515 520 525 

TGG GCC AGC AAC AGC ATA CCC GCT CGG GCC GGA GAG GCC CCT CGG CCA 
Trp Ala Ser Asn Ser lie Pro Ala Arg Ala Gly Glu Ala Pro Arg Pro 
530 535 540 

CCA CCA GTG CTG TCT AGG CAT TAT GGA CTT CTC GGC CAG TGT TAC CTG 
Pro Pro Val Leu Ser Arg His Tyr Gly Leu Leu Gly Gin Cys Tyr Leu 
545 550 555 560 

AGC ACG GTG AAT GGA CAG TGT GCT AAC CCC CTA GGT AGT CTG ACT TCT 
Ser Thr Val Asn Gly Gin Cys Ala Asn Pro Leu Gly Ser Leu Thr Ser 
565 570 575 

CAG GAG GAC TGC TGT GGC AGT GTG GGG ACC TTC TGG GGG GTG ACC TCC 
Gin Glu Asp Cys Cys Gly Ser Val Gly Thr Phe Trp Gly Val Thr Ser 
580 585 590 

TGT GCT CCC TGC CCA CCC AGA CAA GAG GGT CCA GCC TTC CCA GTG ATT 
Cys Ala Pro Cys Pro Pro Arg Gin Glu Gly Pro Ala Phe Pro Val He 
595 600 605 

GAA AAT GGC CAG CTG GAG TGT CCC CAA GGA TAC AAG AGA CTG AAC CTC 
Glu Asn Gly Gin Leu Glu Cys Pro Gin Gly Tyr Lys Arg Leu Asn Leu 
610 615 620 

AGC CAC TGC CAA GAT ATC AAT GAG TGC CTG ACC CTG GGC CTC TGC AAG 
Ser His Cys Gin Asp lie Asn Glu Cys Leu Thr Leu Gly Leu Cys Lys 
625 630 635 640 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



1776 



1824 



1872 



1920 
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GAC TCG GAG 
Asp Ser Glu 



CCT GGC CTC 
Pro Gly Leu 



GCT GTC TCC 
Ala Val Ser 
675 

ACC TGC ACC 
Thr Cys Thr 
690 

TGC AGC CGT 
Cys Ser Arg 
705 

CTG CCT GGC 
Leu Pro Gly 



TAC ACC TAC 
Tyr Thr Tyr 



GAA GAG GAA 
Glu Glu Glu 
755 

GCA CCC CCA 
Ala Pro Pro 
770 

GCC ACC TGG 
Ala Thr Trp 
785 

GCT GTT CAG 
Ala Val Gin 



GGG GAT GCC 
Gly Asp Ala 



CCA GAG AGT 
Pro Glu Ser 
835 

GTG ACA CAC 
Val Thr His 
850 



TGC GTG 
Cys Val 
645 

ATG CTG 
Met Leu 
660 

ATG CAG 
Met Gin 



CTG CCT 
Leu Pro 



GTG GGC 
Val Gly 



ACA GAA 
Thr Glu 
- 725 

TCG AGC 
Ser Ser 
740 

CTG GCT 
Leu Ala 



CCT GGG 
Pro Gly 



ATT GAG 
He Glu 



AAC ACC 
Asn Thr 



GAT CCG 
Asp Pro 



CAG GGA 
Gin Gly 



TTG GTT 
Leu Val 
695 

AAA GCC 
Lys Ala 
710 

GCC TTC 
Ala Phe 



AGG GGC AGC TAC 
Arg Gly Ser Tyr 
650 

TCA AGG AGC CGC 
Ser Arg Ser Arg 
665 

CTA TGC TAC CGG 
Leu Cys Tyr Arg 
680 

CAT CGG ATC ACC 
His Arg He Thr 



TGG GGT 
Trp Gly 



AGG GAG 
Arg Glu 



ATC ACA 
He Thr 
805 

ACT GGA 
Thr Gly 
820 

CCA GCA 
Pro Ala 



TCA GAC 
Ser Asp 



AGC CCC 
Ser Pro 



CAA GCA 
Gin Ala 
775 

GCT GAG 
Ala Glu 
790 

ACC AGT 
Thr Ser 



ATC CGC 
He Arg 
745 

TTA AGG 
Leu Arg 
760 

GAG AGG 
Glu Arg 



ACC CTC 
Thr Leu 



GCT CCC 
Ala Pro 



AGA CCA 
Arg Pro 



GAA GAG 
Glu Glu 



AGC CCC 
Ser Pro 



CCA GAC 
Pro Asp 
855 



GCA CCA 
Ala Pro 
825 

CAA GTG 
Gin Val 
840 

TTT GAT 
Phe Asp 



AGC ACA 
Ser Thr 
715 

ATC TGC 
He Cys 
730 

CTG TCT 
Leu Ser 



GAG CAG 
Glu Gin 



CAA CCA 
Gin Pro 



CCT GAC 
Pro Asp 
795 

CAC CTA 
His Leu 
810 

TCC TTG 
Ser Leu 



ATT CCC 
He Pro 



CCA TGT 
Pro Cys 



CTG TGC ACC TGC AGG 1968 
Leu Cys Thr Cys Arg 
655 

TGC GTA TCG GAC AAG 2016 
Cys Val Ser Asp Lys 
670 

TCA CTG GGG TCT GGT 2064 
Ser Leu Gly Ser Gly 
685 

AAG CAG ATA TGC TGC 2112 

Lys Gin He Cys Cys 

700 

TGT GAA CAG TGT CCC 2160 
Cys Glu Gin Cys Pro 
720 

CCT GCT GGC CAT GGC 2208 
Pro Ala Gly His Gly 
735 

ATG AGG AAA GCC GAA 2256 
Met Arg Lys Ala Glu 
750 

ACA GAG CAG AGC ACT 2304 
Thr Glu Gin Ser Thr 
765 

CTC CGG GCA GCC ACC 2352 

Leu Arg Ala Ala Thr 

780 

AAA GGT GAC TCT CGG 24 00 

Lys Gly Asp Ser Arg 
800 

CCT GCC CGG GTA CCA 24 4 8 

Pro Ala Arg Val Pro 
815 

CCT GGA CAG GGC ATT 24 96 

Pro Gly Gin Gly He 
830 

TCC AGT GAT GTC TTG 254 4 

Ser Ser Asp Val Leu 
845 

TTT GCT GGA GCC TCC 2592 

Phe Ala Gly Ala Ser 

860 
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AAC ATC TGT GGC CCT GGG ACC TGT GTG AGC CTC CCA AAT GGA TAC AGA 2640 
Asn lie Cys Gly Pro Gly Thr Cys Val Ser Leu Pro Asn Gly Tyr Arg 
865 870 875 880 

TGT GTC TGC AGC CCT GGC TAC CAG CTA CAC CCC AGC CAA GAC TAC TGT 2688 
Cvs Val Cys Ser Pro Gly Tyr Gin Leu His Pro Ser Gin Asp Tyr Cys 
885 890 895 

ACT GAT GAC AAC GAG TGT ATG AGG AAC CCC TGT GAA GGA AGA GGG CGC 2736 
Thr Asp Asp Asn Glu Cys Met Arg Asn Pro Cys Glu Gly Arg Gly Arg 
900 905 910 

TGT GTC AAC AGT GTG GGC TCC TAC TCC TGC CTC TGC TAT CCT GGC TAC 2784 
Cys Val Asn Ser Val Gly Ser Tyr Ser Cys Leu Cys Tyr Pro Gly Tyr 
915 920 925 

ACA CTA GTC ACC CTC GGA GAC ACA CAG GAG TGC CAA GAT ATC GAT GAG 2832 
Thr Leu Val Thr Leu Gly Asp Thr Gin Glu Cys Gin Asp lie Asp Glu 
930 935 940 



^GT GAG CAG CCC GGG GTG TGC AGT GGT GGG CGA TGC AGC AAC ACG GAG 
Cys Glu Gin Pro Gly Val Cys Ser Gly Gly Arg Cys Ser Asn Thr Glu 
O 945 950 955 960 



2880 



GGC TCG TAC CAC TGC GAG TGT GAT CGG GGC TAC ATC ATG GTC AGG AAA 2 928 

Glv Ser Tyr His Cys Glu Cys Asp Arg Gly Tyr He Met Val Arg Lys 
965 970 975 



^ GGA CAC TGT CAA GAT ATC AAC GAA TGC CGT CAC CCT GGT ACC TGC CCT 2 97 6 

® Gly His Cys Gin Asp lie Asn Glu Cys Arg His Pro Gly Thr Cys Pro 
HI 980 98 5 990 

k GAT GGG AGA TGC GTC AAC TCC CCT GGC TCC TAC ACT TGT CTG GCC TGT 3024 

m Asp Gly Arg Cys Val Asn Ser Pro Gly Ser Tyr Thr Cys Leu Ala Cys 
995 1000 1005 

r~ 

2f GAG GAG GGC TAT GTA GGC CAG AGT GGG AGC TGT GTA GAT GTC AAT GAG 3072 

S Glu Glu Gly Tyr Val Gly Gin Ser Gly Ser Cys Val Asp Val Asn Glu 
U 1010 1015 1020 

TGT CTG ACC CCT GGG ATA TGT ACC CAT GGA AGG TGC ATC AAC ATG GAA 3120 
Cys Leu Thr Pro Gly He Cys Thr His Gly Arg Cys lie Asn Met Glu 
1025 1030 1035 1040 

GGC TCC TTT AGA TGC TCC TGT GAG CCG GGC TAT GAG GTC ACC CCA GAC 3168 
Gly Ser Phe Arg Cys Ser Cys Glu Pro Gly Tyr Glu Val Thr Pro Asp 
1045 1050 1055 

AAG AAG GGC TGC CGA GAT GTG GAC GAG TGT GCC AGC CGA GCC TCG TGC 3216 
Lys Lys Gly Cys Arg Asp Val Asp Glu Cys Ala Ser Arg Ala Ser Cys 
1060 1065 1070 

CCC ACG GGC CTC TGC CTC AAC ACG GAG GGC TCC TTC ACC TGC TCA GCC 3264 
Pro Thr Gly Leu Cys Leu Asn Thr Glu Gly Ser Phe Thr Cys Ser Ala 
1075 1080 1085 

TGT CAG AGC GGG TAC TGG GTG AAC GAA GAT GGC ACT GCC TGT GAA GAC 3312 



Cys Gin Ser Gly Tyr Trp Val Asn Glu Asp Gly Thr Ala Cys Glu Asp 
1090 1095 1100 

TTG GAT GAA TGT GCC TTC CCT GGA GTC TGC CCC ACA GGC GTC TGC ACC 
Leu Asp Glu Cys Ala Phe Pro Gly Val Cys Pro Thr Giy Val Cys Thr 
1105 lUO 1H5 1120 

AAT ACT GTA GGC TCC TTC TCC TGC AAG GAC TGT GAC CAG GGC TAC CGG 
Asn Thr Val Gly Ser Phe Ser Cys Lys Asp Cys Asp Gin Gly Tyr Arg 
1125 H30 H35 

CCC AAC CCC CTG GGC AAC AGA TGC GAA GAT GTG GAT GAG TGT GAA GGT 
Pro Asn Pro Leu Gly Asn Arg Cys Glu Asp Val Asp Glu Cys Glu Gly 
1140 1145 H50 

CCC CAA AGC AGC TGC CGG GGA GGC GAA TGC AAG AAC ACA GAA GGT TCC 
Pro Gin Ser Ser Cys Arg Gly Gly Glu Cys Lys Asn Thr Glu Gly Ser 
1155 H60 H65 

TAC CAA TGC CTC TGT CAC CAG GGC TTC CAG CTG GTC AAT GGC ACC ATG 
Tyr Gin Cys Leu Cys His Gin Gly Phe Gin Leu Val Asn Gly Thr Met 
1170 H75 H80 



1200 



i 


TGT 


GAG 


GAC 


GTG 


AAT 


GAG 


TGT 


GTT 


GGG 


GAA 


GAG 


CAT 




Cys 


Glu 


Asp 


Val 


Asn 


Glu 


Cys 


Val 


Gly Glu 


Glu 


His 




1185 








1190 








1195 
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GGC 


GAG 


TGC 


CTC 


AAC 


AGC 


CTG 


GGC 


TCC 


TTC 


TTC 


TGC 




Gly 


Glu 


Cys 


Leu 


Asn 


Ser 


Leu 


Gly 


Ser 


Phe 


Phe 


Cys 


C6 






1205 








1210 
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GGC 


TTT 


GCT 


AGT 


GCT 


GAG 


GGG 


GGC 


ACC 


AGA 


TGC 


CAG 
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Gly 


Phe 


Ala 


Ser 


Ala 


Glu 


Gly 


Gly 


Thr 


Arg 


Cys 


Gin 


yii 






1220 








1225 








TGT 


GCA 


GCC 


ACA 


GAC 


CCG 


TGT 


CCG 


GGA 


GGA 


CAC 


TGT 


ru 


Cys 


Ala 


Ala 


Thr 


Asp 


Pro 


Cys 


Pro 


Gly 


Gly 


His 


Cys 






1235 








1240 










GGC 


TCC 


TTC 


AGC 


TGT 


CTG 


TGT 


GAG 


ACT 


GCT 


TCC 


TTC 




Gly Ser 


Phe 


Ser 


Cys 


Leu 


Cys 


Glu 


Thr 


Ala 


Ser 


Phe 






1250 








1255 








126 




GAC 


AGC 


GGA 


GAA 


TGT 


TTG 


GAT 


ATT 


GAT 


GAG 


TGT 


GAG 




Asp 


Ser 


Gly 


Glu 


Cys 


Leu 


Asp 


He 


Asp 


Glu 


Cys 


Glu 




1265 








1270 








1275 



1215 



1230 



1245 



1280 

CCG GTG TGC GGA GCC TGG AGG TGT GAG AAC AGT CCT GGT TCC TAC CGC 
Pro Val Cys Gly Ala Trp Arg Cys Glu Asn Ser Pro Gly Ser Tyr Arg 
1285 1290 1295 

TGC ATC CTG GAC TGC CAG CCT GGA TTC TAT GTG GCG CCA AAT GGA GAC 
Cys He Leu Asp Cys Gin Pro Gly Phe Tyr Val Ala Pro Asn Gly Asp 
1300 1305 1310 

TGC ATT GAC ATA GAT GAA TGT GCC AAT GAC ACT GTG TGT GGG AAC CAT 
Cys He Asp lie Asp Glu Cys Ala Asn Asp Thr Val Cys Gly Asn His 



3360 



3408 



3456 



3504 



3552 



3600 



3648 



3696 



3744 



3792 



3840 



3888 



3936 



3984 
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1315 1320 1325 

GGC TTC TGT GAC AAC ACG GAC GGC TCC TTC CGC TGC CTG TGT GAC CAG 4032 
Gly Phe Cys Asp Asn Thr Asp Gly Ser Phe Arg Cys Leu Cys Asp Gin 
1330 1335 1340 

GGC TTC GAG ACC TCA CCA TCA GGC TGG GAG TGT GT^ GAT GTG AAC GAG 4080 
Gly Phe Glu Thr Ser Pro Ser Gly Trp Glu Cys Val Asp Val Asn Glu 
1345 1350 1355 1360 

TGT GAG CTC ATG ATG GCA GTG TGT GGG GAT GCG CTC TGT GAG AAC GTG 4128 
Cys Glu Leu Met Met Ala Val Cys Gly Asp Ala Leu Cys Glu Asn Val 
1365 1370 1375 

GAA GGC TCC TTC CTG TGC CTT TGC GCC AGT GAC CTT GAG GAG TAC GAC 417 6 

Glu Gly Ser Phe Leu Cys Leu Cys Ala Ser Asp Leu Glu Glu Tyr Asp 
1380 1385 1390 

GCA GAA GAA GGA CAC TGC CGT CCT CGG GTG GCT GGA GCT CAG AGA ATC 4224 
Ala Glu Glu Gly His Cys Arg Pro Arg Val Ala Gly Ala Gin Arg He 
1395 1400 1405 



CCA 


GAG 


GTC 


CGG 


ACA 


GAG 


GAC 


CAG 


GCT 


CCA 


AGC 


CTT 


ATC 


CGC 


ATG 


GAA 


4272 


Pro 


Glu 


Val 


Arg 


Thr 


Glu 


Asp 


Gin 


Ala 


Pro 


Ser 


Leu 


He 


Arg 


Met 


Glu 






1410 








1415 








1420 










TGC 


TAC 


TCT 


GAA 


CAC 


AAT 


GGT 


GGT 


CCT 


CCC 


TGC 


TCT 


CAA 


ATC 


CTG 


GGC 


4320 


Cys 


Tyr 


Ser 


Glu 


His 


Asn 


Gly 


Gly 


Pro 


Pro 


Cys 


Ser 


Gin 


He 


Leu 


Gly 




1425 








1430 








1435 








1440 




CAG 


AAC 


TCC 


ACA 


CAG 


GCC 


GAG 


TGC 


TGC 


TGC 


ACT 


CAG 


GGT 


GCC 


AGA 


TGG 


4368 


Gin 


Asn 


Ser 


Thr 


Gin 


Ala 


Glu 


Cys 


Cys 


Cys 


Thr 


Gin 


Gly Ala 


Arg 


Trp 










1445 








1450 








1455 




GGA 


AAG 


GCC 


TGT 


GCG 


CCC 


TGC 


CCA 


TCT 


GAG 


GAC 


TCA 


GTT 


GAA 


TTC 


AGT 


4416 


Gly 


Lys 


Ala 


Cys 


Ala 


Pro 


Cys 


Pro 


Ser 


Glu 


Asp 


Ser 


Val 


Glu 


Phe 


Ser 






1460 








1465 








1470 






CAG 


CTC 


TGC 


CCC 


AGT 


GGT 


CAA 


GGT 


TAC 


ATC 


CCA 


GTG 


GAA 


GGA 


GCC 


TGG 


4464 


Gin 


Leu 


Cys 


Pro 


Ser 


Gly 


Gin 


Gly 


Tyr 


He 


Pro 


Val 


Glu 


Gly 


Ala 


Trp 








1475 








148 


0 








148 


5 









ACA TTT GGA CAA ACC ATG TAT ACA GAT GCC GAT ~AA TGT GTA CTG TTT 4 512 

Thr Phe Gly Gin Thr Met Tyr Thr Asp Ala Asp Glu Cys Val Leu Phe 
1490 1495 1500 

GGG CCT GCT CTC TGC CAG AAT GGC CGA TGC TCA AAC ATA GTG CCT GGC 45 60 

Gly Pro Ala Leu Cys Gin Asn Gly Arg Cys Ser Asn He Val Pro Gly 
1505 1510 1515 1520 



TAC ATT TGC CTG TGC AAC CCT GGC TAC CAC TAT GAT GCC TCC AGC AGG 
Tyr He Cys Leu Cys Asn Pro Gly Tyr His Tyr Asp Ala Ser Ser Arg 
1525 1530 1535 



4608 



AAG TGC CAG GAT CAC AAC GAA TGC CAG GAC TTG GCC TGT GAG AAC GGT 4 656 

Lys Cys Gin Asp His Asn Glu Cys Gin Asp Leu Ala Cys Glu Asn Gly 
1540 1545 1550 



m 



m 

BE 
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GAG TGT GTG AAC CAA GAA GGC TCC TTC CAT TGC CTC TGC AAT CCC CCC 
Glu Cys Val Asn Gin Glu Gly Ser Phe His Cys Leu Cys Asn Pro Pro 
1555 1560 1565 

CTC ACC CTA GAC CTC AGT GGG CAG CGC TGT GTG AAC ACG ACC AGC AGC 
Leu Thr Leu Asp Leu Ser Gly Gin Arg Cys Val Asn Thr Thr Ser Ser 
15 70 1575 1580 

ACG GAG GAC TTC CCT GAC CAT GAC ATC CAC ATG GAC ATC TGC TGG AAA 
Thr Glu Asp Phe Pro Asp His Asp lie His Met Asp lie Cys Trp Lys 
1585 1590 1591 l* 00 

AAA GTC ACC AAT GAT GTG TGC AGC CAG CCC TTG CGT GGG CAC CAT ACC 
Lvs Val Thr Asn Asp Val Cys Ser Gin Pro Leu Arg Gly His His Thr 
1605 1610 1615 

ACC TAT ACA GAA TGC TGC TGC CAA GAT GGG GAG GCC TGG AGC CAG CAA 
Thr Tyr Thr Glu Cys Cys Cys Gin Asp Gly Glu Ala Trp Ser Gin Gin 
1620 1625 1630 

TGC GCT CTG TGC CCG CCC AGG AGC TCT GAG GTC TAC GCT CAG CTG TGC 
Cys Ala Leu Cys Pro Pro Arg Ser Ser Glu Val Tyr Ala Gin Leu Cys 
1635 ' 1640 1645 

AAC GTG GCT CGG ATT GAG GCA GAG CGC GGA GCA GGG ATC CAC TTC CGG 
Asn Val Ala Arg lie Glu Ala Glu Arg Gly Ala Gly lie His Phe Arg 
1650 1655 1660 

CCA GGC TAT GAG TAT GGC CCT GGC CTG GAC GAT CTG CCT GAA AAC CTC 
Pro Glv Tyr Glu Tyr Gly Pro Gly Leu Asp Asp Leu Pro Glu Asn Leu 
1665 1670 1675 1680 

TAC GGC CCA GAT GGG GCT CCC TTC TAT AAC TAC CTA GGC CCC GAG GAC 
Tvr Glv Pro Asp Gly Ala Pro Phe Tyr Asn Tyr Leu Gly Pro Glu Asp 
1685 1690 1695 

ACT GCC CCT GAG CCT CCC TTC TCC AAC CCA GCC AGC CAG CCG GGA GAC 
Thr Ala Pro Glu Pro Pro Phe Ser Asn Pro Ala Ser Gin Pro Gly Asp 
1700 1^05 1710 

AAC ACA CCT GTC CTT GAG CCT CCT CTG CAG CCC TCT GAA CTT CAG CCT 
Asn Thr Pro Vai Leu Glu Pro Pro Leu Gin Pro Ser Glu Leu Gin Pro 
1715 1720 1725 

CAC TAT CTA GCC AGC CAC TCA GAA CCC CCT GCC TCC TTC GAA GGC CTT 
His Tyr Leu Ala Ser His Ser Glu Pro Pro Ala Ser Phe Glu Gly Leu 
1730 1735 1740 

CAG GCT GAG GAA TGT GGC ATC CTG AAT GGC TGT GAG AAT GGC CGC TGC 
Gin Ala Glu Glu Cys Gly He Leu Asn Gly Cys Glu Asn Gly Arg Cys 
1745 1750 1755 1760 

GTG CGT GTG CGG GAG GGC TAC ACT TGC GAC TGC TTT GAG GGC TTC CAG 
Val Arg Val Arg Glu Gly Tyr Thr Cys Asp Cys Phe Glu Gly Phe Gin 
1765 1770 1775 



4704 



4752 



4800 



4848 



4896 



4944 



4992 



5040 



5088 



5136 



5184 



5232 



5280 



5328 



CTG GAT GCG CCC ACA TTG GCC TGT GTG GAT GTG AAC GAG TGT GAA GAC 
Leu Asp Ala Pro Thr Leu Ala Cys Val Asp Val Asn Glu Cys Glu Asp 
1780 1785 1790 

TTG AAC GGG CCT GCA CGA CTC TGT GCA CAC GGT CAC TGT GAG AAC ACA 
Leu Asn Gly Pro Ala Arg Leu Cys Ala His Gly His Cys Glu Asn Thr 
1795 1800 1805 

GAG GGT TCC TAT CGC TGC CAC TGT TCG CCA GGT TAC GTG GCA GAG CCA 
Glu Gly Ser - Arg Cys His Cys Ser Pro Gly Tyr Val Ala Glu Pro 
1810 1815 1820 

GGC CCC CCA CAC TGT GCG GCC AAG GAG 
Gly Pro Pro His Cys Ala Ala Lys Glu 
1825 1830 



5376 



5424 



5472 



5499 



m 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1833 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Glu Ser Thr Ser Pro Arg Gly Leu Arg Cys Pro Gin 
1 5 10 

His Ser Gly Ala Met Arg Ala Pro Thr Thr Ala Arg Cys 
20 25 

He Gin Arg Val Arg Trp Arg Gly Phe Leu Pro Leu Val 
35 40 45 



Leu Cys Ser 
15 

Ser Gly Cys 
30 

Leu Ala Val 



Leu Met Gly Thr Ser His Ala Gin Arg Asp Ser He Gly Arg Tyr Glu 
50 55 60 



Pro Ala Ser Arg Asp Ala Asn Arg Leu Trp His Pro Val 
65 70 75 

Pro Ala Ala Ala Ala Ala Lys Val Tyr Ser Leu Phe Arg 
85 90 

Ala Pro Val Pro Gly Leu Ser Pro Ser Glu Trp Asn Gin 
100 105 

Gly Asn Pro Gly Trp Leu Ala Glu Ala Glu Ala Arg Arg 
115 120 125 

Thr Gin Gin Leu Arg Arg Val Gin Pro Pro Val Gin Thr 
130 135 140 

His Pro Arg Gly Gin Gin Gin He Ala Ala Arg Ala Ala 
145 150 155 



Gly Ser His 
80 

Glu Pro Asp 
95 

Pro Ala Gin 
110 

Pro Pro Arg 
Arg Arg Ser 



Pro Ser Val 
160 



Ala Arg Leu Glu Thr Pro Gin Arg Pro Ala Ala Ala Arg Arg Gly Arg 

165 170 



Leu Thr Gly Arg Asn Val Cys Gly Gly Gin Cys Cys Pro Gly Trp Thr 

180 185 



Thr Ser Asn Ser Thr Asn His Cys He Lys Pro Val Cys Gin Pro Pro 
195 200 205 



Cys Gin Asn Arg Gly Ser Cys Ser Arg Pro Gin Val Cys lie Cys Arg 
210 215 2,0 

Ser Gly Phe Arg Gly Ala Arg Cys Glu Glu Val He Pro Glu Glu Glu 
225 230 235 

Phe Asp Pro Gin Asn Ala Arg Pro Val Pro Arg Arg Ser Val Glu Arg 
245 250 

Ala Pro Gly Pro His Arg Ser Ser Glu Ala Arg Gly Ser Leu Val Thr 



260 



Arg He Gin Pro Leu Val Pro Pro Pro Ser Pro Pro Pro Ser Arg Arg 



275 - 280 



Leu Ser Gin Pro Trp 
290 



Pro Leu Gin Gin His Ser Gly Pro Ser Arg Thr 



295 



300 



Val Arg Arg Tyr Pro Ala Thr Gly Ala Asn Gly Gin Leu Met Ser Asn 



305 



310 



Ala Leu Pro Ser Gly Leu Glu Leu Arg Asp Ser Ser Pro Gin Ala Ala 



His Val As 



325 330 335 

n His Leu Ser Pro Pro Trp Gly Leu Asn Leu Thr Glu Lys 



340 



345 



350 



He Lys Lys lie Lys Val Val Phe Thr Pro Thr lie Cys Lys Gin Thr 

355 360 365 

Cys Ala Arg Gly Arg Cys Ala Asn Ser Cys Glu Lys Gly Asp Thr Thr 



370 



375 



Thr Leu Tyr Ser Gin Gly Gly His Gly His Asp Pro Lys Ser Gly Phe 



385 



390 



395 



Arg lie Tyr Phe Cys Gin lie Pro Cys Leu Asn Gly Gly Arg Cys He 



Gly Arg Asp Glu Cys Trp Cys Pro Ala Asn Ser Thr Gly Lys Phe Cys 

420 425 

Val Pro Gin Pro Asp Arg Glu Pro Ala Gly Arg Gly Ser 



His Leu Pro 
435 



440 



445 



Arg His Arg Thr Leu Leu Glu Gly Pro Leu Lys Gin Ser Thr Phe Thr 



450 



455 



Leu Pro Leu Ser Asn Gin Leu Ala Ser Val Asn Pro Ser Leu Val Lys 

aii\ 475 480 

465 470 "* l:> 

Val Gin lie His His Pro Pro Glu Ala Ser Val Gin lie His Gin Val 

485 490 495 

Ala Arg Val Arg Gly Glu Leu Asp Pro Val Leu Glu Asp Asn Ser Val 
500 505 510 

Glu Thr Arg Ala Ser His Arg Pro His Gly Asn Leu Gly His Ser Pro 
515 520 525 

Trp Ala Ser Asn Ser lie Pro Ala Arg Ala Gly Glu Ala Pro Arg Pro 
530 535 540 

Pro Pro Val Leu Ser Arg His Tyr Gly Leu Leu Gly Gin Cys Tyr Leu 
545 550 555 560 

Ser Thr Val Asn Gly Gin Cys Ala Asn Pro Leu Gly Ser Leu Thr Ser 
565 570 575 

O Qln Glu Asp Cys Cys Gly Ser Val Gly Thr Phe Trp Gly Val Thr Ser 
580 585 590 



If! 



Iff 



Cys Ala Pro Cys Pro Pro Arg Gin Glu Gly Pro Ala Phe Pro Val lie 

595 600 605 

Glu Asn Gly Gin Leu Glu Cys Pro Gin Gly Tyr Lys Arg Leu Asn Leu 

610 615 620 



Ser His Cys Gin Asp lie Asn Glu Cys Leu Thr Leu Gly Leu Cys Lys 
O 625 630 635 640 



Asp Ser Glu Cys Val Asn Thr Arg Gly Ser Tyr Leu Cys Thr Cys Arg 
645 650 655 

Pro Gly Leu Met Leu Asp Pro Ser Arg Ser Arg Cys Val Ser Asp Lys 
660 665 670 

Ala Val Ser Met Gin Gin Gly Leu Cys Tyr Arg Ser Leu Gly Ser Gly 

675 680 685 

Thr Cys Thr Leu Pro Leu Val His Arg He Thr Lys Gin lie Cys Cys 
690 695 700 

Cys Ser Arg Val Gly Lys Ala Trp Gly Ser Thr Cys Glu Gin Cys Pro 
705 710 715 720 

Leu Pro Gly Thr Glu Ala Phe Arg Glu He Cys Pro Ala Gly His Gly 
725 730 735 

Tyr Thr Tyr Ser Ser Ser Asp He Arg Leu Ser Met Arg Lys Ala Glu 
740 745 750 

Glu Glu Glu Leu Ala Ser Pro Leu Arg Glu Gin Thr Glu Gin Ser Thr 
755 760 765 



m 



m 



III 



o 



Ala Pro Pro Pro Gly Gin Ala Glu Arg Gin Pro Leu Arg Ala Ala Thr 
770 "5 780 

Ala Thr Trp lie Glu Ala Glu Thr Leu Pro Asp Lys Gly Asp Ser Arg 
785 790 795 

Ala Val Gin lie Thr Thr Ser Ala Pro His Leu Pro Ala Arg Val Pro 

805 810 815 

Gly Asp Ala Thr Gly Arg Pro Ala Pro Ser Leu Pro Gly Gin Gly He 
820 825 830 

Pro Glu Ser Pro Ala Glu Glu Gin Val He Pro Ser Ser Asp Val Leu 
835 840 845 

Val Thr His Ser Pro Pro Asp Phe Asp Pro Cys Phe Ala Gly Ala Ser 
850 855 860 

Asn He Cys Gly Pro Gly Thr Cys Val Ser Leu Pro Asn Gly Tyr Arg 



875 



865 870 

Cys Val Cys Ser Pro Gly Tyr Gin Leu His Pro Ser Gin Asp Tyr Cys 

ri r\r\ 895 



885 



890 



Thr Asp Asp Asn Glu Cys Met Arg Asn Pro Cys Glu Gly Arg Gly Arg 

905 910 



900 



Cys 



Val Asn Ser Val Gly Ser Tyr Ser Cys Leu Cys Tyr Pro Gly Tyr 



915 



920 



925 



Thr Leu Val Thr Leu Gly Asp Thr Gin Glu Cys Gin Asp lie Asp Glu 
930 935 940 

Cys Glu Gin Pro Gly Val Cys Ser Gly Gly Arg Cys Ser Asn Thr Glu 



945 



950 



955 



Gly Ser Tyr His Cys Glu Cys Asp Arg Gly Tyr lie Met Val Arg Lys 



965 9^0 

Gly His Cys Gin Asp lie Asn Glu Cys Arg His Pro Gly Thr Cys Pro 

~ ~ *■ 990 



980 



985 



Asp Gly Arg Cys Val Asn Ser Pro Gly Ser Tyr Thr Cys Leu Ala Cys 
995 1000 1° 05 

Glu Glu Gly Tyr Val Gly Gin Ser Gly Ser Cys Val Asp Val Asn Glu 
1010 1015 1° 20 

Cys Leu Thr Pro Gly He Cys Thr His Gly Arg Cys He Asn Met Glu 
1025 1030 1035 1040 

Gly Ser Phe Arg Cys Ser Cys Glu Pro Gly Tyr Glu Val Thr Pro Asp 
1 i^c msn 1055 



1045 



Lys Lys Gly Cys Arg Asp Val Asp Glu Cys Ala Ser Arg Ala Ser Cys 
1060 1065 1070 



Pro Thr Gly Leu Cys Leu Asn Thr Glu Sly Ser Phe Thr Cy. Ser Ala 

1075 1080 
Cys an Ser Gly Tyr Trp Val Asn Sl» Asp Gl» Thr A.. Cys Clu Asp 

1090 1095 

Asp Glu Cys Ala Phe Pro Gly Val Cys Pro Thr Gly val Cys Thr 

1110 1115 

Thr val Gly Ser Phe Ser Cys Lys Asp Cys Asp Gin Gly TyrArg 

1125 113U 

Pro Asn Pre Leu Gly Asn Arg Cys Glu Asp Val Asp Glu Cys Glu Gly 

1140 1145 
Pro Gin SerSer Cys Arg Gly GlyGlu Cys Lys Asn Thr^Glu Gly Ser 



Leu 
1105 

Asn 



1155 



Tyr Gin Cys Leu Cys 



His Gin Gly Phe Gin Leu Val Asn Gly Thr Met 



1170 



1175 



1180 



Hi 

hi 
m 

TTl 

ffi 



Cys Glu Asp Val Asn Glu Cys Val Gly Glu Glu His Cys Ala Pro His^ 
1185 * 1190 1195 



Gly Glu Cys Leu Asn Ser Leu Gly Ser Phe 
1205 121 

Gly Phe Ala Ser Ala Glu Gly Gly Thr 
1220 122 



Ser Phe Phe Cys Leu Cys Ala Pro 



1215 



Arg Cys Gin Asp Val Asp Glu 



1230 



n evs Ala Ala Thr Asp Pro Cys Pro ( 
m 1235 1240 



Pro Gly Gly His Cys Val Asn Thr Glu 



1245 



ys 



hi Gly 



Ser Phe Ser Cys Leu Cys_Glu Thr Ala Ser Phe^Gln Pro Ser Pro 



1250 



1255 



Asp Ser Gly Glu Cys Leu Asp He Asp Glu Cys Glu Asp Arg Glu As^ 

265 127 0 

Pro V.1 Cys Gly Al. Trp Ar, Cys Glu Asn Ser Pro Gly Ser Tyr Ar, 

1285 1290 

— , -i n _ rlw ph ~ Tvr val Ala Pro Asn Gly Asp 
Cys lie Leu Asp Cys Gin Pro Gly Phe Tyr va± a 

1300 iiUb 

Cys He A,p Ue Asp Glu Cys Al. A.* Asp Thr Val Cy. Gly Asn Kls 

1315 13 ^° 
Gly Phe Cys Asp Asn Thr Asp Gly Ser Phe Arc, Cy. L.U Cys Asp Gin 

1330 1335 
Gly Phe Glu Thr Ser Pro Ser Gly Tr P Glu Cys Val Asp v.1 Asn Glu 
1345 I 350 1355 

Cys Glu Leu Met Met Ala Val Cys Gly Asp Ala Leu Cys Glu Mn V.1 

1365 1370 



m 



m 



Glu Gly Ser Phe Leu Cys Leu Cys Ala Ser Asp Leu Glu Glu Tyr Asp 
1380 1385 1390 

Ala Glu Glu Gly His Cys Arg Pro Arg Val Ala Gly Ala Gin Arg lie 
1395 1400 1405 

Pro Glu Val Arg Thr Glu Asp Gin Ala Pro Ser Leu He Arg Met Glu 
1410 1415 1420 

Cys Tyr Ser Glu His Asn Gly Gly Pro Pro Cys Ser Gin He Leu Gly 
1425 1430 1435 1440 

Gin Asn Ser Thr Gin Ala Glu Cys Cys Cys Thr Gin Gly Ala Arg Trp 
1445 1450 1455 

Gly Lys Ala Cys Ala Pro Cys Pro Ser Glu Asp Ser Val Glu Phe Ser 
1460 1465 1470 

Gin Leu Cys Pro Ser Gly Gin Gly Tyr He Pro Val Glu Gly Ala Trp 
1475 1480 1485 



Thr Phe Gly Gin Thr Met Tyr Thr Asp Ala Asp Glu Cys Val Leu Phe 
1490 * 1495 1500 



*G Gly Pro Ala Leu Cys Gin Asn Gly Arg Cys Ser Asn He Val Pro Gly 
fij 1505 1510 1515 1520 

EJ1 

fQ Tyr He Cys Leu Cys Asn Pro Gly Tyr His Tyr Asp Ala Ser Ser Arg 

1525 1530 1535 



L. Lys Cys Gin Asp His Asn Glu Cys Gin Asp Leu Ala Cys Glu Asn Gly 
IJ 1540 1545 1550 

y& Glu Cys Val Asn Gin Glu Gly Ser Phe His Cys Leu Cys Asn Pro Pro 
ftj 1555 1560 1565 

S Leu Thr Leu Asp Leu Ser Gly Gin Arg Cys Val Asn Thr Thr Ser Ser 
^ 1570 1575 1580 

Thr Glu Asp Phe Pro Asp His Asp He His Met Asp He Cys Trp Lys 
1585 1590 1595 1600 

Lys Val Thr Asn Asp Val Cys Ser Gin Pro Leu Arg Gly His His Thr 
1605 1610 1615 

Thr Tyr Thr Glu Cys Cys Cys Gin Asp Gly Glu Ala Trp Ser Gin Gin 
1620 1625 1630 

Cys Ala Leu Cys Pro Pro Arg Ser Ser Glu Val Tyr Ala Gin Leu Cys 
1635 1640 1645 

Asn Val Ala Arg He Glu Ala Glu Arg Gly Ala Gly He His Phe Arg 
1650 1655 1660 

Pro Gly Tyr Glu Tyr Gly Pro Gly Leu Asp Asp Leu Pro Glu Asn Leu 
1665 1670 1675 1680 



* 



* 



in 



m 

ru 



Tyr Gly Pro Asp Gly Ala Pro Phe Tyr Asn Tyr Leu Gly Pro Glu Asp 
1685 1690 1695 

Thr Ala Pro Glu Pro Pro Phe Ser Asn Pro Ala Ser Gin Pro Gly Asp 
1700 1705 1710 

Asn Thr Pro Val Leu Glu Pro Pro Leu Gin Pro Ser Glu Leu Gin Pro 
1715 1720 1125 

His Tyr Leu Ala Ser His Ser Glu Pro Pro Ala Ser Phe Glu Gly Leu 
1730 1735 1740 

Gin Ala Glu Glu Cys Gly lie Leu Asn Gly Cys Glu Asn Gly Arg Cys 



1745 



1750 



1755 



1760 



Val Arg Val Arg Glu Gly Tyr Thr Cys Asp Cys Phe Glu Gly Phe Gin 
1765 1770 1775 

Leu Asp Ala Pro Thr Leu Ala Cys Val Asp Val Asn Glu Cys Glu Asp 
1780 1785 1790 

Leu Asn Gly Pro Ala Arg Leu Cys Ala His Gly His Cys Glu Asn Thr 
1795 - 1800 I 805 

Glu Gly Ser Tyr Arg Cys His Cys Ser Pro Gly Tyr Val Ala Glu Pro 
1810 1815 1820 

Gly Pro Pro His Cys Ala Ala Lys Glu 
1825 1830 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3759 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..37 5 9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG CGC CAG GCC GGC GGA TTG GGG CTG CTG GCA CTA CTC CTG CTG GCG 
Met Arg Gin Ala Gly Gly Leu Gly Leu Leu Ala Leu Leu Leu Leu Ala 
15 10 15 

CTG CTG GGC CCC GGC GGC CGA GGG GTG GGC CGG CCG GGC AGC GGG GCA 
Leu Leu Gly Pro Gly Gly Arg Gly Val Gly Arg Pro Gly Ser Gly Ala 
20 25 30 

CAG GCG GGG GCG GGG CGC TGG GCC CAA CGC TTC AAG GTG GTC TTT GCG 
Gin Ala Gly Ala Gly Arg Trp Ala Gin Arg Phe Lys Val Val Phe Ala 
35 40 45 



48 



96 



144 



CCT GTG ATC TGC AAG CGG ACC TGT CTG AAG GGC CAG TGT CGG GAC AGC 
Pro Val lie Cys Lys Arg Thr Cys Leu Lys Gly Gin Cys Arg Asp Ser 
50 55 60 



192 



TGT CAG CAG GGC TCC AAC ATG ACG CTC ATC GGA GAG AAC GGC CAC AGC 
Cys Gin Gin Gly Ser Asn Met Thr Leu He Gly Glu Asn Gly His Ser 
65 70 75 80 



240 



ACC GAC ACG CTC ACC GGT TCT GCC TTC CGC GTG GTG GTG TGC CCT CTA 
Thr Asp Thr Leu Thr Gly Ser Ala Phe Arg Val Val Val Cys Pro Leu 
85 90 95 



288 



CCC TGC ATG AAC GGT GGC CAG TGC TCT TCC CGA AAC CAG TGC CTG TGT 
Pro Cys Met Asn Gly Gly Gin Cys Ser Ser Arg Asn Gin Cys Leu Cys 
100 105 110 



336 



CCC CCG GAT TTC ACG GGG CGC TTC TGC CAG GTG CCT GCT GCA GGA ACC 
Pro Pro Asp Phe Thr Gly Arg Phe Cys Gin Val Pro Ala Ala Gly Thr 
115 120 125 



384 



ru 

m 

65 



GGA GCT GGC ACC GGG AGT TCA GGC CCC GGC TGG CCC GAC CGG GCC ATG 4 32 

Gly Ala Gly Thr Gly Ser Ser Gly Pro Gly Trp Pro Asp Arg Ala Met 
130 - 135 140 

TCC ACA GGC CCG CTG CCG CCC CTT GCC CCA GAA GGA GAG TCT GTG GCT 4 80 

Ser Thr Gly Pro Leu Pro Pro Leu Ala Pro Glu Gly Glu Ser Val Ala 
145 150 155 160 

AGC AAA CAC GCC ATT TAC GCG GTG CAG GTG ATC GCA GAT CCT CCC GGG 528 
Ser Lys His Ala He Tyr Ala Val Gin Val He Ala Asp Pro Pro Gly 
165 170 175 



iU 



CCG GGG GAG GGT CCT CCT GCA CAA CAT GCA GCC TTC TTG GTG CCC CTG 
Pro Gly Glu Gly Pro Pro Ala Gin His Ala Ala Phe Leu Val Pro Leu 
180 185 190 



576 



GGG CCA GGA CAA ATC TCG GCA GAA GTG CAG GCT CCG CCC CCC GTG GTG 
Gly Pro Gly Gin He Ser Ala Glu Val Gin Ala Pro Pro Pro Val Val 
195 200 205 



624 



AAC GTG CGT GTC CAT CAC CCT CCT GAA GCT TCC GTT CAG GTG CAC CGC 
Asn Val Arg Val His His Pro Pro Glu Ala Ser Val Gin Val His Arg 
210 215 220 



672 



ATC GAG GGG CCG AAC GCT GAA GGC CCA GCC TCT TCC CAG CAC TTG CTG 
He Glu Gly Pro Asn Ala Glu Gly Pro Ala Ser Ser Gin His Leu Leu 
225 230 235 240 



720 



CCG CAT CCC AAG CCC CCG CAC CCG AGG CCA CCC ACT CAA AAG CCA CTG 
Pro His Pro Lys Pro Pro His Pro Arg Pro Pro Thr Gin Lys Pro Leu 
245 250 255 



768 



GGC CGC TGC TTC CAG GAC ACA TTG CCC AAG CAG CCT TGT GGC AGC AAC 
Gly Arg Cys Phe Gin Asp Thr Leu Pro Lys Gin Pro Cys Gly Ser Asn 
260 265 270 



816 



m 
ru 



CCT TTG CCT GGC CTT ACC AAG CAG GAA GAT TGC TGC GGT AGC ATC GGT 
Pro Leu Pro Gly Leu Thr Lys Gin Glu Asp Cys Cys Gly Ser lie Gly 
275 280 285 

ACT GCC TGG GGA CAA AGC AAG TGT CAC AAG TGC CCA CAG CTT CAG TAT 
Thr Ala Trp Gly Gin Ser Lys Cys His Lys Cys Pro Gin Leu Gin Tyr 
290 295 300 

ACA GGG GTG CAG AAG CCT GTA CCT GTA CGT GGG GAG GTG GGT GCT GAC 
Thr Glv Val " lys Pro Val Pro Val Arg Gly Glu Val Gly Ala Asp 
305 "" 310 315 320 

TGC CCC CAG GGC TAC AAG AGG CTC AAC AGC ACC CAC TGC CAG GAT ATC 
Cys Pro Gin Gly Tyr Lys Arg Leu Asn Ser Thr His Cys Gin Asp He 
325 330 335 

AAC GAA TGT GCG ATG CCC GGG AAT GTG TGC CAT GGT GAC TGC CTC AAC 
Asn Glu Cys Ala Met Pro Gly Asn Val Cys His Gly Asp Cys Leu Asn 
340 345 350 

AAC CCT GGC TCT TAT CGC TGT GTC TGC CCG CCC GGT CAT AGC TTG GGT 
Asn Pro Gly Ser Tyr Arg Cys Val Cys Pro Pro Gly His Ser Leu Gly 
355 360 365 

CCC CTC GCA GCA CAG TGC ATT GCC GAC AAA CCA GAG GAG AAG AGC CTG 
Pro Leu Ala Ala Gin Cys lie Ala Asp Lys Pro Glu Glu Lys Ser Leu 
370 375 380 

TGT TTC CGC CTT GTG AGC ACC GAA CAC CAG TGC CAG CAC CCT CTG ACC 
Cvs Phe Arg Leu Val Ser Thr Glu His Gin Cys Gin His Pro Leu Thr 
385 390 395 400 

ACA CGC CTA ACC CGC CAG CTC TGC TGC TGT AGT GTG GGT AAA GCC TGG 
Thr Arg Leu Thr Arg Gin Leu Cys Cys Cys Ser Val Gly Lys Ala Trp 
405 410 415 

GGT GCC CGG TGC CAG CGC TGC CCG GCA GAT GGT ACA GCA GCC TTC AAG 
Gly Ala Arg Cys Gin Arg Cys Pro Ala Asp Gly Thr Ala Ala Phe Lys 
420 425 430 

GAG ATC TGC CCC GGC TGG GAA AGG GTA CCA TAT CCT CAC CTC CCA CCA 
Glu lie Cys Pro Gly Trp Glu Arg Val Pro Tyr Pro His Leu Pro Pro 
435 440 445 

GAC GCT CAC CAT CCA GGG GGA AAG CGA CTT CTC CCT CTT CCT GCA CCC 
Asp Ala His His Pro Gly Gly Lys Arg Leu Leu Pro Leu Pro Ala Pro 
450 455 460 

GAC GGG CCA CCC AAA CCC CAG CAG CTT CCT GAA AGC CCC AGC CGA GCA 
Asp Glv Pro Pro Lys Pro Gin Gin Leu Pro Glu Ser Pro Ser Arg Ala 
465 470 475 480 

CCA CCC CTC GAG GAC ACA GAG GAA GAG AGA GGA GTG ACC ATG GAT CCA 
Pro Pro Leu Glu Asp Thr Glu Glu Glu Arg Gly Val Thr Met Asp Pro 
485 490 495 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1344 



1392 



1440 



1488 



CCA GTG AGT GAG GAG CGA TCG GTG CAG CAG AGC CAC CCC ACT ACC ACC 



1536 



ru 
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Pro Val Ser Glu Glu Arg Ser Val Gin Gin Ser His Pro Thr Thr Thr 
500 505 510 

ACC TCA CCC CCC CGG CCT TAC CCA GAG CTC ATC TCT CGC CCC TCC CCA 
Thr Ser Pro Pro Arg Pro Tyr Pro Glu Leu He Ser Arg Pro Ser Pro 
515 520 525 

CCT ACC TTC CAC CGG TTC CTG CCA GAC TTG CCC CCA TCC CGA AGT GCA 
Pro Thr Phe His Arg Phe Leu Pro Asp Leu Pro Pro Ser Arg Ser Ala 
530 535 540 

GTG GAG ATC GCC JCC ACT CAG GTC ACA GAG ACC GAT GAG TGC CGA TTG 
Val Glu He Ala Pro Thr Gin Val Thr Glu Thr Asp Glu Cys Arg Leu 
545 550 555 560 

AAC CAG AAT ATC TGT GGC CAT GGA CAG TGT GTG CCT GGC CCC TCG GAT 
Asn Gin Asn He Cys Gly His Gly Gin Cys Val Pro Gly Pro Ser Asp 
565 570 575 

TAC TCC TGC CAC TGC AAC GCT GGC TAC CGG TCA CAC CCG CAG CAC CGC 
Tvr Ser Cys His Cys Asn Ala Gly Tyr Arg Ser His Pro Gin His Arg 
580 585 590 

TAC TGT GTT GAT GTG AAC GAG TGC GAG GCA GAG CCC TGC GGC CCC GGG 
Tyr Cys Val Asp Val Asn Glu Cys Glu Ala Glu Pro Cys Gly Pro Gly 
595 600 605 

AAA GGC ATC TGT ATG AAC ACT GGT GGC TCC TAC AAT TGT CAC TGC AAC 
Lys Gly He Cys Met Asn Thr Gly Gly Ser Tyr Asn Cys His Cys Asn 
610 615 620 

CGA GGC TAC CGC CTC CAC GTG GGT GCA GGG GGC CGC TCG TGC GTG GAC 
Arq Gly Tyr Arg Leu His Val Gly Ala Gly Gly Arg Ser Cys Val Asp 
625 630 635 640 

CTG AAC GAG TGC GCC AAG CCT CAC CTG TGT GGG GAC GGT GGC TTC TGC 
Leu Asn Glu Cys Ala Lys Pro His Leu Cys Gly Asp Gly Gly Phe Cys 
645 650 655 

ATC AAC TTC CCT GGT CAC TAC AAA TGC AAC TGC TAT CCT GGC TAC CGG 
He Asn Phe Pro Gly His Tyr Lys Cys Asn Cys Tyr Pro Gly Tyr Arg 
660 665 670 

CTC AAG GCC TCC CGA CCG CCC ATT TGC GAA GAC ATC GAC GAG TGT CGC 
Leu Lys Ala Ser Arg Pro Pro lie Cys Glu Asp He Asp Glu Cys Arg 
675 680 685 

GAC CCT AGC ACC TGC CCT GAT GGC AAA TGT GAA AAC AAA CCT GGC AGC 
Asp Pro Ser Thr Cys Pro Asp Gly Lys Cys Glu Asn Lys Pro Gly Ser 
690 695 700 

TTC AAG TGC ATC GCC TGC CAG CCT GGC TAC CGT AGC CAG GGG GGC GGG 
Phe Lys Cys He Ala Cys Gin Pro Gly Tyr Arg Ser Gin Gly Gly Gly 
705 710 715 720 

GCC TGT CGT GAT GTC AAC GAA TGC TCC GAG GGT ACC CCC TGC TCT CCT 
Ala Cys Arg Asp Val Asn Glu Cys Ser Glu Gly Thr Pro Cys Ser Pro 



1584 



1632 



1680 



1728 



1776 



1824 



1872 



1920 



1968 



2016 



2064 



2112 



2160 



2208 




725 



730 



735 



~? - 

£0 



GGA TGG TGT GAG AAC CTT CCG GGT TCT TAC CGT TGC ACG TGT GCC CAG 
Gly Trp Cys Glu Asn Leu Pro Gly Ser Tyr Arg Cys Thr Cys Ala Gin 
740 745 750 

GGG ATA CGA ACC CGCACA GGA CGC CTC AGT TGC ATA GAC GTG GAT GAG 
Gly lie Arg Thr Arg Thr Gly Arg Leu Ser Cys He Asp Val Asp Glu 
755 760 765 

TGT GAG GCT GGG AAA GTG TGC CAA GAT GGC ATC TGC ACG AAC ACA CCA 
Cys Glu Ala Gly ^ys Val Cys Gin Asp Gly He Cys Thr Asn Thr Pro 
770 775 780 

GGC TCT TTC CAG TGT CAG TGC CTC TCC GGC TAT CAT CTG TCA AGG GAT 
Gly Ser Phe Gin Cys Gin Cys Leu Ser Gly Tyr His Leu Ser Arg Asp 
785 790 795 800 

CGG AGC CGC TGT GAG GAC ATT GAT GAA TGT GAC TTC CCT GCG GCC TGC 
Arq Ser Arg Cys Glu Asp He Asp Glu Cys Asp Phe Pro Ala Ala Cys 
805 810 815 

ATC GGG GGT GAC TGC ATC AAT ACC AAT GGT TCC TAC AGA TGT CTC TGT 
He Gly Gly Asp Cys He Asn Thr Asn Gly Ser Tyr Arg Cys Leu Cys 
820 825 830 

CCC CTG GGT CAT CGG TTG GTG GGC GGC AGG AAG TGC AAG AAA GAT ATA 
Pro Leu Gly His Arg Leu Val Gly Gly Arg Lys Cys Lys Lys Asp He 
835 840 845 

GAT GAG TGC AGC CAG GAC CCA GGC CTG TGC CTG CCC CAT GCC TGC GAG 
Asp Glu Cys Ser Gin Asp Pro Gly Leu Cys Leu Pro His Ala Cys Glu 
850 855 860 

AAC CTC CAG GGC TCC TAT GTC TGT GTC TGT GAT GAG GGT TTC ACA CTC 
Asn Leu Gin Gly Ser Tyr Val Cys Val Cys Asp Glu Gly Phe Thr Leu 
865 870 875 880 

ACC CAG GAC CAG CAT GGG TGT GAG GAG GTG GAG CAG CCC CAC CAC AAG 
Thr Gin Asp Gin His Gly Cys Glu Glu Val Glu Gin Pro His His Lys 
885 890 895 



2256 



2304 



2352 



2400 



2448 



2496 



2544 



2592 



2640 



2688 



AAG GAG TGC TAC CTT AAC TTC GAT GAC ACA GTG TTC TGT GAC AGC GTA 
Lys Glu Cys Tyr Leu Asn Phe Asp Asp Thr Val Phe Cys Asp Ser Val 
900 905 910 

TTG GCT ACC AAT GTC ACT CAG CAG GAA TGC TGT TGC TCT CTG GGA GCT 
Leu Ala Thr Asn Val Thr Gin Gin Glu Cys Cys Cys Ser Leu Gly Ala 
915 920 925 

GGC TGG GGA GAC CAC TGC GAA ATC TAT CCC TGT CCA GTC TAC AGC TCA 
Gly Trp Gly Asp His Cys Glu lie Tyr Pro Cys Pro Val Tyr Ser Ser 
930 935 940 

GCC GAA TTT CAC AGC CTG GTG CCT GAT GGG AAA AGG CTA CAC TCA GGA 
Ala Glu Phe His Ser Leu Val Pro Asp Gly Lys Arg Leu His Ser Gly 
945 950 955 960 



2736 



2784 



2832 



2880 



m 



m 



ru 



CAA CAA CAT TGT GAA CTA TGC ATT CCT GCC CAC CGT GAC ATC GAC GAA 
Gin Gin lis Z Glu Leu Cys He Pro Ala His Arg Asp He Asp Glu 
965 970 ^ 

TGC ATA TTG TTT GGG GCA GAG ATC TGC AAG GAG GGC AAG TGT GTG AAC 
Cys lie Leu Phe Gly Ala Glu lie Cys Lys Glu Gly Lys Cys Val Asn 

980 985 

ACG CAG CCC GGC TAC GAG TGC TAC TGC AAG CAG GGC TTC TAC TAC GAT 
Thr Gin Pro Gly Tyr Glu Cys Tyr Cys Lys Gin Gly Phe Tyr Tyr Asp 
995 1000 1005 

GGC AAC CTG CTG GAG TGC GTG GAC GTG GAT GAG TGC TTG GAT GAG TCT 
tly Z Leu Leu Glu Cys Val As P Val Asp Glu Cys Leu Asp Glu Ser 
1010 1015 1020 

AAC TGC AGG AAC GGA GTG TGT GAG AAC ACA CGT GGC GGC TAC CGC TGT 
Asn Cys Arg Asn Gly Val Cys Glu Asn Thr Arg Gly Gly Tyr Arg Cys 
1025 1030 1035 

GCC TGC ACT CCG CCG GCA GAG TAC AGT CCC GCA CAG GCC CAG TGT CTG 
Ala Cys Thr Pro Pro Ala Glu Tyr Ser Pro Ala Gin Ala Gin Cys Leu 
. 1045 1° 50 

ATC CCG GAG AGA TGG AGC ACG CCC CAG AGA GAC GTG AAG TGT GCT GGG 
He Pro Glu Arg Trp Ser Thr Pro Gin Arg Asp Val Lys Cys Ala Gly 
1060 1065 10^0 

CCC AGC GAG GAG AGG ACG GCA TGT GTA TGG GGC CCC TGG GCG GGA CCT 
Sa Ser Glu Glu Arg Thr Ala Cys Val Trp Gly Pro Trp Ala Gly Pro 
1075 1080 1085 

GCC CTC ACT TTT GAT GAC TGC TGC TGC CGC CAG CCG CGG CTG GGT ACC 
S Zu tZ Phe Asp Asp Cys Cys Cys Arg Gin Pro Arg Leu Gly Thr 
1090 1095 1100 

CAG TGC AGA CCG TGC CCG CCA CGT GGC ACC GGG TCC CAG TGC CCG ACT 
Gin Cys Arg Pro Cys Pro Pro Arg Gly Thr Gly Ser Gin Cys Pro hr 
1105 mo 1115 

TCA CAG AGT GAG AGC AAT TCT TTC TGG GAC ACA AGC CCC CTG CTA CTG 
S« Sn Ser Glu Ser^Asn Ser Phe Trp As^Thr Ser Pro Leu Leu^Leu 

GGG AAG TCT CCG CGA GAC GAA GAC AGC TCA GAG GAG GAT TCA GAT GAG 
Gly L^s Ser Pro Arg Asp Glu Asp Server Glu Glu Asp Ser** Glu 

TGC CGT TGT GTG AGC GGA CGC TGT GTG CCA CGG CCA GGC GGG GCG GTA 
lys Arg Cys Val Ser Gly Arg Cys Val Pro Arg Pro Gly Gly Ala Val 
1155 H60 1165 

TCC GAG TGT CCT GGA GGC TTT CAG CTG GAC GCC TCC CGT GCC CGC TGC 
Cys Glu Cys Pro Gly Gly Phe Gin Leu Asp Ala Ser Arg Ala Arg Cys 
1170 11 7 5 1180 



2928 



2976 



3024 



3072 



3120 



3168 



3216 



3264 



3312 



3360 



3408 



3456 



3504 



3552 



GTG GAC ATT GAT GAG TGC CGA 
Val Asp He Asp Glu Cys Arg 
1185 

AAG AGC GAG CGG TGC GTG AAC 
Lys Ser Glu Arg Cys Val Asn 
1205 

AAA GCT GGC TTC ACG CGC AGC 
Lys Ala Gly Phe Thr Arg Ser 
1220 

GCC GCC GCT GAT GAT GCA GCC 
Ala Ala Ala Asp Asp Ala Ala 
1235 

CGA GGG TAT TTT CAC 
Arg Gly Tyr Phe His 
1250 



GAA CTG AAC CAG CGG GGA CTG CTG TGT 
Glu Leu Asn Gin Arg Gly Leu Leu Cys 
1195 1200 

ACC AGT GGA TCC TTC CGC TGT GTC TGC 
Thr Ser Gly Ser Phe Arg Cys Val Cys 
1210 1215 

CGC CCT CAC GGG CCT GCG TGC CTC AGC 
Arg Pro His Gly Pro Ala Cys Leu Ser 
1225 1230 

ATA GCC CAC ACC TCA GTG ATC GAT CAT 
He Ala His Thr Ser Val He Asp His 
1240 1245 



3600 



3648 



3696 



3744 



3759 



(2) INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1253 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Arg Gin Ala Gly Gly Leu Gly Leu Leu Ala Leu Leu Leu Leu Ala 
I 5 10 

Leu Leu Gly Pro Gly Gly Arg Gly Val Gly Arg Pro Gly Ser Gly Ala 
20 25 

Gin Ala Gly Ala Gly Arg Trp Ala Gin Arg Phe Lys Val Val Phe Ala 
35 40 45 

Pro Val lie Cys Lys Arg Thr Cys Leu Lys Gly Gin Cys Arg Asp Ser 
50 55 60 

Cys Gin Gin Gly Ser Asn Met Thr Leu He Gly Glu Asn Gly His Ser 
65 70 75 

Thr Asp Thr Leu Thr Gly Ser Ala Phe Arg Val Val Val Cys Pro Leu 
85 90 95 

Pro Cys Met Asn Gly Gly Gin Cys Ser Ser Arg Asn Gin Cys Leu Cys 
100 10 5 110 

Pro Pro Asp Phe Thr Gly Arg Phe Cys Gin Val Pro Ala Ala Gly Thr 
115 120 125 

Gly Ala Gly Thr Gly Ser Ser Gly Pro Gly Trp Pro Asp Arg Ala Met 



a 



in 



FU 

b 



130 



135 



140 



Ser Thr Gly Pro Leu Pro Pro Leu Ala Pro Glu Gly Glu Ser Val Ala 

145 150 155 160 

Ser Lys His Ala lie Tyr Ala Val Gin Val lie Ala Asp Pro Pro Gly 

165 170 175 

Pro Gly Glu Gly Pro Pro Ala Gin His Ala Ala Phe Leu Val Pro Leu 

180 185 190 

Gly Pro Gly Gin lie Ser Ala Glu Val Gin Ala Pro Pro Pro Val Val 

195 200 205 

Asn Val Arg Val His His Pro Pro Glu Ala Ser Val Gin Val His Arg 

210 215 220 

He Glu Gly Pro Asn Ala Glu Gly Pro Ala Ser Ser Gin His Leu Leu 

225 230 235 240 

Pro His Pro Lys Pro Pro His Pro Arg Pro Pro Thr Gin Lys Pro Leu 



245 



250 



255 



Gly Arg Cys Phe Gin Asp Thr Leu Pro Lys Gin Pro Cys Gly Ser Asn 
260 265 270 



265 



Pro Leu Pro Gly Leu Thr Lys Gin Glu Asp Cys Cys Gly Ser lie Gly 
275 280 285 

Thr Ala Trp Gly Gin Ser Lys Cys His Lys Cys Pro Gin Leu Gin Tyr 
290 295 300 

Thr Gly Val Gin Lys Pro Val Pro Val Arg Gly Glu Val Gly Ala Asp 
305 310 315 320 

Cys Pro Gin Gly Tyr Lys Arg Leu Asn Ser Thr His Cys Gin Asp lie 
325 330 335 

Asn Glu Cys Ala Met Pro Gly Asn Val Cys His Gly Asp Cys Leu Asn 
340 345 350 

Asn Pro Gly Ser Tyr Arg Cys Val Cys Pro Pro Gly His Ser Leu Gly 
355 360 365 

Pro Leu Ala Ala Gin Cys He- Ala Asp Lys Pro Glu Glu Lys Ser Leu 

375 380 



370 



Cys Phe Arg 
385 



Leu Val Ser Thr Glu His Gin Cys Gin His Pro Leu Thr 



390 



395 



400 



Thr Arg Leu Thr Arg Gin Leu Cys Cys Cys Ser Val Gly Lys Ala Trp 
405 410 415 

Gly Ala Arg Cys Gin Arg Cys Pro Ala Asp Gly Thr Ala Ala Phe Lys 



420 



425 



Glu He Cys Pro Gly Trp Glu Arg Val Pro Tyr Pro His Leu Pro Pro 



435 



440 



445 



Aso Ala His His Pro Gly Gly Lys Arg Leu Leu Pro Leu Pro Ala Pro 

455 460 



450 



Asp Gly Pro Pro Lys Pro Gin Gin Leu Pro Glu Ser Pro Ser Arg Ala 



465 



470 



475 



in 

ft I 



m 



n 625 



Pro Pro Leu Glu Asp Thr Glu Glu Glu Arg Gly Val Thr Met Asp Pro 
485 490 495 

Pro Val Ser Glu Glu Arg Ser Val Gin Gin Ser His Pro Thr Thr Thr 
500 505 510 

Thr Ser Pro Pro Arg Pro Tyr Pro Glu Leu lie Ser Arg Pro Ser Pro 
515 520 525 

Pro Thr Phe His Arg Phe Leu Pro Asp Leu Pro Pro Ser Arg Ser Ala 
530 535 540 

Val Glu He Ala Pro Thr Gin Val Thr Glu Thr Asp Glu Cys Arg Leu 
545 550 555 560 

Asn Gin Asn lie Cys Gly His Gly Gin Cys Val Pro Gly Pro Ser Asp 
565 570 575 

Tyr Ser Cys His Cys Asn Ala Gly Tyr Arg Ser His Pro Gin His Arg 
580 585 590 

Tyr Cys Val Asp Val Asn Glu Cys Glu Ala Glu Pro Cys Gly Pro Gly 
595 600 605 

Lvs Gly He Cys Met Asn Thr Gly Gly Ser Tyr Asn Cys His Cys Asn 
610 615 620 

Aro Glv Tyr Arg Leu His Val Gly Ala Gly Gly Arg Ser Cys Val Asp 

630 635 640 



Leu Asn Glu Cys Ala Lys Pro His Leu Cys Gly Asp Gly Gly Phe Cys 
645 650 655 

lie Asn Phe Pro Gly His Tyr Lys Cys Asn Cys Tyr Pro Gly Tyr Arg 
660 665 670 



Leu Lys Ala Ser Arg Pro Pro lie Cys Glu Asp lie Asp Glu Cys Arg 
675 680 685 

Asp Pro Ser Thr Cys Pro Asp Gly Lys Cys Glu Asn Lys Pro Gly Ser 
690 695 "700 

Phe Lys Cys lie Ala Cys Gin Pro Gly Tyr Arg Ser Gin Gly Gly Gly 
705 710 715 720 

Ala Cys Arg Asp Val Asn Glu Cys Ser Glu Gly Thr Pro Cys Ser Pro 
725 730 735 

Gly Trp Cys Glu Asn Leu Pro Gly Ser Tyr Arg Cys Thr Cys Ala Gin 



740 



745 750 



Gly He Arg Thr Arg Thr Gly Arg Leu Ser Cys lie Asp Val Asp Glu 
755 760 765 

Cys Glu Ala Gly Lys Val Cys Gin Asp Gly lie Cys Thr Asn Thr Pro 
770 775 780 

Gly Ser Phe Gin Cys Gin Cys Leu Ser Gly Tyr His Leu Ser Arg Asp 
78 5 790 795 800 

Arg Ser Arg Cy. Glu Asp lie Asp Glu Cys Asp Phe Pro Ala Ala Cys 
805 810 815 

He Glv Glv Asp Cys lie Asn Thr Asn Gly Ser Tyr Arg Cys Leu Cys 
820 825 830 

Pro Leu Gly His Arg Leu Val Gly Gly Arg Lys Cys Lys Lys Asp He 
835 840 845 

Asp Glu Cys Ser Gin Asp Pro Gly Leu Cys Leu Pro His Ala Cys Glu 
850 855 860 

^ Asn Leu Gin Gly Ser Tyr Val Cys Val Cys Asp Glu Gly Phe Thr Leu 
m 865 870 875 880 

Hi Thr Gin Asp Gin His Gly Cys Glu Glu Val Glu Gin Pro His His Lys 
ft 885 890 895 

Lvs Glu Cys Tyr Leu Asn Phe Asp Asp Thr Val Phe Cys Asp Ser Val 
900 905 910 



Pi 



eg 

01 



E Leu Ala Thr Asn Val Thr Gin Gin Glu Cys Cys Cys Ser Leu Gly Ala 
01 915 920 925 



ru 



Gly Trp Gly Asp His Cys Glu lie Tyr Pro Cys Pro Val Tyr Ser Ser 
930 935 940 

Ala Glu Phe His Ser Leu Val Pro Asp Gly Lys Arg Leu His Ser Gly 
945 950 955 960 

Gin Gin His Cys Glu Leu Cys He Pro Ala His Arg Asp lie Asp Glu 
965 970 975 

Cys He Leu Phe Gly Ala Glu lie Cys Lys Glu Gly Lys Cys Val Asn 
980 985 990 

Thr Gin Pro Gly Tyr Glu Cys Tyr Cys Lys Gin Gly Phe Tyr Tyr Asp 
995 1000 1005 

Gly Asn Leu Leu Glu Cys Val Asp Val Asp Glu Cys Leu Asp Glu Ser 
1010 1015 1020 

Asn Cys Arg Asn Gly Val Cys Glu Asn Thr Arg Gly Gly Tyr Arg Cys 
1025 1030 1035 1040 

Ala Cys Thr Pro Pro Ala Glu Tyr Ser Pro Ala Gin Ala Gin Cys Leu 



m 

i y 

rf'i 



Si 

o 



La. 

0 
O 



1045 10^0 1055 

lie Pro Glu Arg Trp Ser Thr Pro Gin Arg Asp Val Lys Cya Ala Gly 

1060 10bb 
Ala Ser Glu Glu Arg Thr Ala Cys Val Trp Gly Pro Trp Ala Gly Pro 

1075 1° 80 108 

Ala Leu Thr Phe Asp Asp Cys Cys Cys Arg Gin Pro Arg Leu Gly Thr 

inQ^ 11UU 



1090 1° 95 
Gin Cys A,g Pro Cys Pro P, o Arg Gly rhx Gly ier Gin Cys Pro Thx 
1105 111° 1115 



Ser Gin Ser Glu Ser Asn Ser Phe Tr P Asp Thr Ser Pro Leu Leu Leu 
1125 H 30 iiJ 

Gly Lys Ser Pro Arg Asp Glu Asp Ser Ser Glu Glu Asp Ser Asp Glu 
1140 ll 45 

Cys Arg Cys Val Ser Gly Arg Cys Val Pro Arg Pro Gly Gly Ala Val 
1155 1160 n 65 

Cys Glu Cys Pro Gly Gly Phe Gin Leu Asp Ala Ser Arg Ala Arg Cys 
1170 ins n 80 

Val Asp lie Asp Glu Cys Arg Glu Leu Asn Gin Arg Gly Leu Leu Cys 
1185 119° 1195 

(? Lys Ser Glu Arg Cys Val Asn Thr Ser Gly Ser Phe Arg Cys Val Cys 

bi 1205 1210 

Lys Ala Gly Phe Thr Arg Ser Arg Prodis Gly Pro Ala Cys^Leu Ser 

Ala Ala Ala Asp Asp Ala Ala lie Ala His Thr Ser Val lie Asp His 

1240 124b 



1235 

Arg Gly Tyr Phe His 
1250 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Gly Glu Ser Val Ala Ser Lys His Ala lie Tyr Ala Val Cys 
1 5 1° 



(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TACCGATGCT ACCGCAGCAA TCTT 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
y ATGCCTAAAC TCTACCAGCA CG 

in 

yD (2) INFORMATION FOR SEQ ID NO: 8: 

fi I 

jr| (i) SEQUENCE CHARACTERISTICS: 

J^i (A) LENGTH: 22 base pairs 

J>= (B) TYPE: nucleic acid 

kM (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

EP (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

fU GAGTCACGTC ATCCATTCCA CA 

o 

W (2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CGTCCAAGTT GTGTCTTAGC AG 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 



* • 



(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gly Pro Pro Gly Pro Gin Gly Ala Thr Gly Pro Leu Gly Pro Lys Gly 
15 10 15 

Gin Thr Gly Glu Pro Gly He Ala Gly Phe Lys Gly Glu Gin Gly Pro 
20 25 30 

Lys Gly Glu Thr Gly Pro Ala Gly Pro Gin Gly Ala Pro Gly Pro Ala 
31 40 45 

Gly Glu Glu Gly Lys 
50 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) - TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



m 

£n GGCCCTCCCG GTCCTCAAGG TGCAACTGGT CCTCTGGGCC CCAAAGGTCA GACGGGTGAG 60 

sjj CCCGGCATCG CTGGCTTCAA AGGTGAACAA GGCCCCAAGG GAGAGACTGG ACCTGCTGGG 120 

L. CCCCAGGGAG CCCCTGGCCC TGCTGGTGAA GAAGGAAAA 159 



I s * (2) INFORMATION FOR SEQ ID NO: 12: 

p (i) SEQUENCE CHARACTERISTICS: 

p (A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : modif iedjoase 

(B) LOCATION: 16 

(D) OTHER INFORMATION: /note= "N = A or G or C or T" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AAACGTCACA CGTGANACGT GAACGTTGCT TGCTGG 36 

(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
TTACGTCCAC GTACACGTCT AGCAAGCAAG CA 



